Nvidia and Researchers Unveil Smarter Training Method for Game-Playing AIs

BTCC / BTCC Square / Global Cryptocurrency /

Author:

Published:

2025-06-19 13:13:01

Nvidia, alongside researchers from the Politehnica University of Bucharest and Mila Quebec AI Institute, has developed a breakthrough reinforcement learning technique called Macro-Action Similarity Penalty (MASP). This method accelerates AI training by identifying similarities between macro-actions—bundled sequences of decisions—enabling more efficient learning in complex environments like video games and robotics.

The MASP approach outperformed established benchmarks such as RAINBOW-DQN in game testing, demonstrating superior adaptability in titles like Breakout and Street Fighter II. While the technique shows promise for applications in autonomous systems and adaptive gaming AI, its computational overhead and dependency on well-designed action sets present implementation challenges.

By:

Oil Stocks Surge Amid Escalating U.S.-Iran Tensions

AICoin AI: Narayana Murthy Backs AI as Key Driver for India’s $250B Tech Industry

|Square

Get the BTCC app to start your crypto journey

Download on the App Store GEI IT ON Google Play

Get started today Scan to join our 100M+ users

Recommended

Promotions

Nvidia and Researchers Unveil Smarter Training Method for Game-Playing AIs

|Square